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(57) ABSTRACT 

In response to receiving a request to perform an enqueue or 
dequeue operation a corresponding queue descriptor speci- 
fying the structure of the queue is referenced to execute the 
operation. The queue descriptor is stored in a processor's 
memory controller logic. 
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QUEUE ARRAY CACHING IN NETWORK 
DEVICES 

BACKGROUND 

[0001] This invention relates to queue arrays for use in 
network devices. 

[0002] Network devices such as routers and switches can 
have line speeds that can be faster than 10 Gigabits. For 
maximum efficiency the network device should be able to 
process data packets, storing them to and retrieving them 
from memory at a rate at least equal to the line rate. 
However, current network devices may lack the necessary 
speed to process data packets at the line speeds. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0003] FIG. 1 is a block diagram of a network system. 

[0004] FIG. 2 is a block diagram of a network device. 

[0005] FIG. 3 shows a queue and queue descriptor. 

[0006] FIG. 4 illustrates an enqueue and a dequeue opera- 
tion. 

DETAILED DESCRIPTION 

[0007] Referring to FIG. 1, a network system 2 for 
processing data packets includes oue or more sources 4 of 
data packets coupled to a network device 6 and one or more 
destinations 8 for the data packets. Each source 4 can 
include other network devices connected over a communi- 
cations path operating at high data packet transfer line 
speeds. Examples of such communications paths include an 
optical carrier (OC)-192 line, and a 10-Gigabit line. Like- 
wise, the destinations 8 also can include other network 
devices, as well as a similar network connection. 

[0008] The network device 6 includes a processor 10 that 
uses a memory (not shown) storing memory data structures. 
The processor executes instructions and operates with the 
memory data structures as configured to receive, store and 
forward the data packets to a specified destination. The 
network device 6 can be part of, a network switch or a 
network router and so forth. The processor 10 also includes 
one or more programming engines. The programming 
engine ("PE") includes a sixteen-entry content addressable 
memory ("CAM"). The CAM tracks, which of its entries is 
the least-recently-used ("LRU"). 

[0009] Referring to FIG. 2, the network device 6 includes 
memory 14 coupled to the processor 10. The memory 14 
stores output queues 18 and their corresponding queue 
descriptors 20. The processor 10 includes memory controller 
logic 38 that includes a cache 12 to store some of the queue 
descriptors 20 as described below. The processor 10 also has 
a queue manager 42 that can be implemented as a program- 
ming engine. A CAM 44 serves as a tag store holding the 
addresses of queue descriptors 20 that are stored in the 
cache. 

[0010] The queue manager 42 receives enqueue requests 
from a set of programming engines that function as a receive 
pipeline 46. The receive pipeline 46 is programmed to 
process and classify data packets received by the network 
device 6 from sources 4 (FIG. 1). The enqueue requests 
specify which output queue 18 an arriving packet should be 



added to. Another programming engine functions as a trans- 
mit scheduler 48 to send dequeue requests to the queue 
manager 42. The dequeue requests specify the output queue 
18 from which a packet is to be removed for transmittal to 
a destination 8 (FIG. 1). 

[0011] An enqueue operation adds information that 
arrived in a data packet to one of the output queues 18 and 
updates the corresponding queue descriptor 20. A dequeue 
operation removes information from one of the output 
queues 18 and updates the corresponding queue descriptor 
20, to allow the network device 6 to transmit the information 
to the appropriate destination 8. 

[0012] An example of an output queue 18 and its corre- 
sponding queue descriptor 20 is shown in FIG. 3. The output 
queue 18 includes a linked list of elements 22, each of which 
contains a pointer 24 to the next element 22 in the output 
queue 18. The pointer 26 of the last element 22 in the queue 
18 contains a null value. A function of the address of each 
element 22 implicitly maps to the information 26 stored in 
the memory 14 that the element 22 represents. For example, 
the first element 22a of output queue 18 shown in FIG. 3 is 
located at address A. The location in memory of the infor- 
mation 26a that element 22a represents is implicit from the 
element's address A, illustrated by dashed arrow 27a. Ele- 
ment 22a contains the address B, which serves as a pointer 
24 to the next element 22b in the output queue 18, located 
at address B. 

[0013] The queue descriptor 20 includes a head pointer 28, 
a tail pointer 30 and a count 32. The head pointer 28 points 
to the first element 22 of the output queue 18, and the tail 
pointer 30 points to the last element 22 of the output queue 
18. The count 32 identifies the number (N) of elements 22 
in the output queue 18. 

[0014] Executing enqueue and dequeue operations for a 
large number of queues 18 in the memory 14 at high- 
bandwidth line rates can be accomplished by storing some of 
the queue descriptors 20 in the cache 12 (FIG. 2). The queue 
manager 42 implements a software-controlled tag store in its 
CAM 44 to identify the addresses in memory 14 of the 
sixteen queue descriptors 20 most-recently-used in enqueue 
or dequeue operations. The cache 12 stores the correspond- 
ing queue descriptors 20 (the head pointer 28, tail pointer 30 
and count 32) stored at the addresses identified in the tag 
store 44. 

[0015] The queue manager 42 issues commands to return 
queue descriptors 20 to memory 14 and fetch new queue 
descriptors from memory such that the queue descriptors 
stored in the cache 12 remain coherent with the addresses in 
the tag store 44. The queue manager 42 also issues com- 
mands to the memory controller logic 38 to indicate which 
queue descriptor 18 in the cache 12 should be used to 
execute the command. The commands that reference the 
head pointer 28 or tail pointer 30 (see FIG. 3) of a queue 
descriptor 20 in the cache 12 are executed in the order in 
which they arrive at the memory controller 38. 

[0016] Referring to FIG. 4, when performing an enqueue 
operation, the address in memory 14 of a new element 22* 
to be added to the queue 18 is stored (as indicated by dashed 
line 40) in the pointer 24d of the element 22d that currently 
is at the address indicated by the tail pointer 30 for that 
queue. The address of the new element 22c address then is 
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stored in the tail pointer 30 of the corresponding queue 
descriptor 20 in the cache 12, as indicated by dashed line 31. 
Because only a single write operation to memory 14 is 
required for an enqueue operation, only two cycles are 
required to update the cache 12. Subsequent enqueue opera- 
tions to the same queue 18 then can be initiated. 

[0017] For dequeue operations, the address contained in 
the head pointer 28 is returned to the queue manager 42 
(FIG. 2) to indicate (by implicit mapping) the location in 
memory 14 of the information 26e to be sent to a specified 
destination device 8 (FIG. 1). The pointer 24a in the element 
22a is read to obtain the address of the next element 22b in 
the queue 18. The address of next element 22b is written to 
the head pointer of the corresponding queue descriptor 20 in 
the cache 12 (indicated by dashed line 29). Subsequent 
dequeue operations to the same queue 18 are delayed until 
the head pointer 28 in the cache 12 is updated. However, so 
long as the element 22 being read is not the only element in 
the queue 18, an enqueue operation with respect to the queue 
18 can proceed even if a dequeue operation is in progress 
because the tail pointer 30 is not affected by the dequeue 
operation. 

[0018] An advantage of locating the cache 12 of queue 
descriptors 20 at the memory controller logic 38 includes 
allowing for low latency access to and from the cache 12 and 
the memory 14. Also, having the control structure for queue 
operations in a programming engine can allow for flexible 
high performance while using existing micro-engine hard- 
ware. 

[0019] Various features of the system can be implemented 
in hardware, software or a combination of hardware and 
software. For example, some aspects of the system can be 
implemented in computer programs executing on program- 
mable computers. Each program can be implemented in a 
high level procedural or object-oriented programming lan- 
guage to communicate with a computer system. Further- 
more, each such computer program can be stored on a 
storage medium, such as read only memory (ROM) readable 
by a general or special purpose programmable computer, for 
configuring and operating the computer when the storage 
medium is read by the computer to perform the functions 
described above. 

[0020] Other implementations are within the scope of the 
following claims. 

What is claimed is: 

1. A method executed in a processor comprising: 

receiving a request to perform an enqueue or a dequeue 
operation with respect to a particular queue; and 

referencing a corresponding queue descriptor stored in a 
cache in a processor's memory controller logic to 
execute the operations, the queue descriptor specifying 
a structure of the queue. 

2. The method of claim 1 further comprising: 

maintaining a list of addresses of a subset of queue 
descriptors stored in a memory in a content addressable 
memory. 

3. The method of claim 2 further comprising: 

storing in the cache a queue descriptor corresponding to 
each address in the list. 



4. The method of claim 3 further comprising: 

tracking an address stored in the content addressable 
memory, the address corresponding to a queue descrip- 
tor that was least recently used for an enqueue or 
dequeue operation. 

5. The method of claim 4 further comprising: 

removing the least-recently-used address from the list if 
the list lacks an entry corresponding to the queue 
specified by the request; and 

replacing the removed address with an address corre- 
sponding to the specified queue. 

6. The method of claim 3 further comprising: 

issuing commands to the memory controller logic to 
return and fetch queue descriptors to and from the 
memory to maintain coherence between the queue 
descriptors in the cache and the list of addresses in the 
content addressable memory. 

7. The method of claim 6 further comprising: 

modifying the queue descriptor referenced by the enqueue 
or dequeue operation; and 

returning the modified queue descriptors to memory from 
the cache. 

8. The method of claim 1 further comprising: 

executing an enqueue operation without waiting for 
completion of a previous dequeue operation. 

9. An apparatus comprising: 

a memory to store queue descriptors, each of which 
specifies a structure of a respective queue; 

a network processor coupled to the memory further com- 
prising: 

memory controller logic that includes a cache to store 
a subset of the queue descriptors in the memory; and 

a programming engine that accesses a list of addresses 
in the memory corresponding to the queue descrip- 
tors stored in the cache; and 

wherein the processor is configured to reference a corre- 
sponding queue descriptor in the cache in response to 
a request to perform an enqueue or a dequeue operation 
with respect to a particular queue. 

10. The apparatus of claim 9 wherein the programming 
engine includes a content addressable memory to store the 
list of addresses. 

11. The apparatus of claim 10 wherein the content addres- 
sable memory is configured to track which address in the list 
was least recently used by the processor for an enqueue or 
dequeue operation. 

12. The apparatus of claim 9 wherein the programming 
engine is configured to: 

remove the least-recently-used address from its list of 
addresses if the list lacks an entry corresponding to the 
queue specified by the request; and 

replace the removed address with an address correspond- 
ing to the specified queue. 

13. The apparatus of claim 9 wherein the programming 
engine is configured to issue commands to the memory 
controller logic to return and fetch queue descriptors to and 
from memory to maintain coherence between the queue 
descriptors in the cache and the list of addresses in the 
programming engine. 
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14. The apparatus of claim 9 wherein the processor is 
configured to return to memory from the cache a queue 
descriptor modified by an enqueue or dequeue operation, 

15. The apparatus of claim 9 wherein the processor is 
configured to execute an enqueue operation without waiting 
for completion of a previous dequeue operation if the queue 
would otherwise be uncmpty upon completion of the 
dequeue operation. 

16. An article comprising a computer-readable medium 
that stores computer-executable instructions for causing a 
computer system to: 

reference a queue descriptor stored in a cache in a 
processor's memory controller logic, in response to 
receiving a request to perform an enqueue or dequeue 
operation with respect to a particular queue, the queue 
descriptor specifying the structure of the queue 

17. The article of claim 16 comprising instructions for 
causing the computer system to: 

maintain in a content addressable memory a list of 
addresses of a subset of queue descriptors stored in a 
memory. 

18. The article of claim 17 comprising instructions for 
causing the computer system to: 

store in the cache a queue descriptor corresponding to 
each address in the list. 

19. The article of claim 18 comprising instructions for 
causing the computer system to: 

track an address in the content addressable memory, the 
address corresponding to a queue descriptor that was 
least recently used for an enqueue or dequeue opera- 
tion. 



20. The article of claim 19 comprising instructions for 
causing the computer system to: 

remove the least-recently-used address from the list if the 
list lacks an entry corresponding to the queue specified 
by the request; and 

replace the removed address with an address correspond- 
ing to the specified queue. 

21. The article of claim 18 comprising instructions for 
causing the computer system to: 

issue commands to the memory controller logic to return 
and fetch queue descriptors to and from the memory to 
maintain coherence between the queue descriptors in 
the cache and the list of addresses in the content 
addressable memory. 

22. The article of claim 21 comprising instructions for 
causing the computer system to: 

return a queue descriptor modified by an enqueue or 
dequeue operation from the cache to memory. 

23. The article of claim 16 comprising instructions for 
causing a computer system to: 

execute an enqueue operation without waiting for comple- 
tion of a previous dequeue operation if the queue would 
otherwise be unempty upon completion of the dequeue 
operation. 

• * * * * 
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